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Abstract 

Background: Recent advancements in next-generation sequencing technology have enabled cost-effective 
sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. 
Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to 
developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To 
enrich genomic resources for Japanese eel {Anguilla japonica), we constructed a ddRAD-based genetic map using an 
Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 1 9 linkage groups 
of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the 
history of genome evolution after the teleost-specific genome duplication. 

Results: We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male 
spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers 
and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the 
current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted 
gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal 
rearrangement events that occurred soon after the teleost-specific genome duplication. 

Conclusions: The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing 
allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled 
sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with 
economically important traits and for investigating comparative genomics of the Japanese eel. 
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Background 

Next-generation sequencing platforms permit for the 
cost-effective sequencing of whole or partial genomes in 
a relatively short time, which is greatly accelerating gen- 
omic research in non-model organisms. In particular, 
single nucleotide polymorphism (SNP) genotyping by 
restriction-site associated DNA sequencing (RAD-seq) 
is a powerful approach for developing a large number of 
genetic markers, for constructing a high-density genetic 
map and for studying the genetic architecture of com- 
plex traits [1-6]. This approach sequences short targets 
that immediately flank sites for a particular restriction 
enzyme throughout a genome; this produces a reduced 
representation of the genome, allowing oversequencing 
of the nucleotides next to the restriction sites and 
detection of SNPs. This approach can rapidly and cost- 
effectively discover SNPs compared with whole-genome 
sequencing (reviewed in [7]); nevertheless, a massive 
amount of sequence data is required to increase the read 
depth for more accurate SNP calling. Therefore, in previous 
studies, RAD-seq has generally been performed using Illu- 
mina sequencing platforms, such as the Genome Analyzer 
IIx (GAIIx) and HiSeq series [1-6]. Recently, double-digest 
RAD-seq (ddRAD-seq) has been developed as a simple and 
robust method [8]. Unlike the RAD-seq, this approach uses 
two restriction enzyme comprising a rare-cutting one and 
frequently-cutting one and avoids random shearing of the 
DNA. Such precise and repeatable size selection allows 
stable shared region recovery across samples (or individ- 
uals). Thus, ddRAD-seq produces more effective SNP 
genotyping compared with RAD-seq, even using a rela- 
tively small number of reads generated by a small next- 
generation benchtop sequencer, such as an Ion Torrent 
Personal Genome Machine (Life Technologies Corpor- 
ation, Carlsbad, CA, USA), which was launched in early 
2011 [9]. This platform performs semiconductor sequen- 
cing that relies on the detection of pH changes induced by 
the release of a proton upon the incorporation of a nu- 
cleotide during strand synthesis [10] and generates over 
500 million reads per run at 4.5 hours. The Ion PGM is a 
relatively new technology; therefore, its output, including 
quality and downstream application, has not yet fully ma- 
tured [10]. However, as the sequencing cost (price per 
base) continues to drop, the platform will soon be applic- 
able for small and medium-size laboratories [11]. 

The Japanese eel (Anguilla japonica) is one of the most 
widely cultivated fish species in East Asian countries; how- 
ever, the farming of this species has been completely 
dependent on seeds from nature, and broodstock domesti- 
cation on a commercial scale has yet to be fully developed. 
Aiming for stable production of this species, the first tech- 
niques for artificial breeding were established in 1970s 
[12,13]. Thereafter, research has improved these tech- 
niques [14], and recently we succeeded in obtaining an F 2 



generation of this species (unpublished results). Although 
further improvements are required for large-scale commer- 
cial production, the broodstock domestication and select- 
ive breeding, including a DNA marker-assisted selection 
based on quantitative trait loci (QTLs), is now within 
reach. A key step in such a genetic improvement scheme 
is an accumulation of genomic resources such as a draft 
genome sequence, transcriptome sequences and a genetic 
map. Recently, the draft genome of the Japanese eel was 
sequenced using Illumina platforms, yielding an assembly 
comprising 323,776 scaffolds and covering 1.15 Gb of the 
genome; gene prediction and annotation for this assembly 
are ongoing [15]. In addition, we constructed the first gen- 
etic linkage map of this species [16]. Integration of a gen- 
etic map and assembled sequences enable us to identify a 
chromosomal order of scaffolds in a draft assembly and 
provides a valuable resource for fine mapping and pos- 
itional cloning of QTLs associated with economically im- 
portant traits [17,18]. However, in the Japanese eel, because 
of the low resolution of the genetic map, the chromosomal 
order of scaffolds in the current assembly is unknown, 
which has limited the use of the genomic sequences. 

In addition to assisting in genetic studies, a genome 
map integrating a genetic map and assembled sequences 
is also a powerful tool for investigating comparative gen- 
omics [19,20]. Eel species are one of the most distantly re- 
lated groups to other teleost species and has diverged 
from the other teleosts in the basal lineage of the teleosts 
[21,22]. Therefore, comparing the genome of the Japanese 
eel with those of other fishes would provide an opportun- 
ity to understand the genomic architecture of ancestral 
teleosts and chromosomal rearrangement events that oc- 
curred during teleost genome evolution. Teleost genomes 
have experienced a teleost- specific whole genome duplica- 
tion event (TGD) [23,24], which is thought to have trig- 
gered an acceleration of chromosomal rearrangements 
[4,25,26]. Indeed, Kasahara et al. [19] inferred that eight 
major chromosomal rearrangements occurred in the an- 
cestral lineage of zebrafish (Danio rerio) and medaka (Ory- 
zias latipes) after the TGD. In addition, the genome of the 
spotted gar (Lepisosteus oculatus), whose lineage diverged 
from the teleost lineage before the TGD, showed that the 
extent of conserved synteny between the gar and human 
(Homo sapiens) was higher than that between the gar and 
zebrafish, and the gar and the stickleback [G aster osteus 
aculeatus), suggesting that the TGD accelerated the loss of 
ancestral syntenies [4]. Given the phylogenic position 
among these fishes, the TGD and subsequent chromo- 
somal rearrangements are presumed to have occurred 
over a relatively short period [19]. The eel lineage di- 
verged from other teleost lineage within this period 
[21,22], Therefore, comparative genomics between the 
Japanese eel and model fishes provides new insights into 
the genome evolution soon after the TGD. 
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Here, we generated a ddRAD-based genetic map of 
the Japanese eel using an Ion PGM sequencer. The map 
comprised 19 linkage groups with a total of 2,672 SNP 
markers and 115 Simple Sequence Repeat (SSR) markers 
and was integrated with the current genome assembly to 
identify the chromosomal order of the scaffolds. In the 
current assembly, 151 Mb of the scaffolds has been an- 
chored on the 19 linkage groups, and 70 Mb of them 
have been ordered on the linkage groups. Comparisons 
of the genomes between the Japanese eel and model 
fishes showed a highly conserved synteny among teleosts 
and revealed the history of genome evolution in the 
basal lineage of the teleosts. 

Methods 

Mapping population 

Forty individuals of a wild-caught glass eel (Average of 
total length = 56.9 mm, body weight = 185.3 mg) of the 
Japanese eel were purchased from commercial fisheries 
in Japan. The sex ratio of this species is skewed towards 
the male under ordinary culture conditions [27,28]. 
Therefore, half of them were feminized by feeding a diet 
containing estradiol- 17 (3 for 5 months [29], and culti- 
vated fishes were used as a dam; the remaining male fish 
were not treated. These female and male eels were culti- 
vated in freshwater for 2-3 years and then acclimated to 
seawater. Hormonal treatment was carried out for arti- 
ficial maturation, and a mapping population was ob- 
tained by in vitro crossing, as described previously 
[16,30,31]. Ninety-two full-sib progeny from the single 
cross were used to develop a linkage map. This project 
was conducted in accordance with the Guidelines for Ani- 
mal Experimentation of the National Research Institute of 
Aquaculture (NRIA), and all animal procedures were 
approved by the Institutional Animal Care and Use 
Committee of the NRIA. 

ddRAD library preparation and sequencing 

Muscle tissues of the parents were preserved in TNES 
urea buffer (10 mM Tris-HCl, pH 7.5, 125 mM NaCl, 
10 mM EDTA, 1% sodium dodecyl sulfate, 8 M urea) at 
room temperature until DNA preparation. Genomic DNA 
was extracted from the stored samples using the standard 
phenol-chloroform technique. Genomic DNA of 198- 
457 days post-hatch juveniles (glass eels) was isolated from 
muscle tissues using the QuickGene-800 extraction plat- 
form (Fujifilm, Tokyo, Japan) with a QuickGene DNA tis- 
sue kit S (DT-S). Our ddRAD libraries were constructed 
using a method of combining a normal ddRAD-seq proto- 
col [8] with an amplicon tagging protocol for Access Array 
technology (Fluidigm, South San Francisco, CA, USA) 
(also see Additional file 1: Figure SI). Briefly, 500 ng of 
DNA template from each individual was double-digested 
with BamHI-HF (10 units per reaction, New England 



Biolabs Inc., Beverly, MA, USA) and Mspl (10 units per 
reaction, New England Biolabs Inc.) at 37°C for 1 h. We 
found approximately 100,000 cut sites of BamHI-HF 
(G|GATCC) and -1,500,000 of Mspl (C|CGG) in the 
current genome assembly of the Japanese eel [15]. Each 
fragmented sample was purified using the MinElute 
PCR purification kit (Qiagen GmbH, Hilden, Germany), 
eluted with 30 \iL of TE buffer (pH 8.0). Fragments were 
size-selected using the Pippin Prep (2% agarose cart- 
ridge, Sage Science, Beverly, MA, USA) under a "broad" 
setting with a mean of 150 bp and range of 120-180 bp. 
After size-selection, each sample was purified using 
Agencourt AMPure XP beads (Beckman Coulter Inc., 
Brea, CA, USA). Purified samples were then ligated to 
CS1 -tagged and CS2-tagged adapters, consisting of the 
common sequence tag 1 (CS1) and the common se- 
quence tag 2 (CS2) of Access Array, respectively. The 
CS1 -tagged adapter binds to overhangs generated by 
BamHI, and the CS2-tagged adapter contains overhangs 
compatible with an Mspl site. Adapter ligation was per- 
formed in a 40- (iL reaction volume containing 25 \iL of 
DNA samples, 2 \iL of CSl-tagged adapter (100 nM), 

4 \iL of CS2-tagged adapter (100 nM), 1 \iL of T4 ligase 
(1 U/(iL, Invitrogen, Carlsbad, CA, USA) and 8 \iL of 

5 x T4 ligation buffer (Invitrogen). The reaction was 
incubated at room temperature for 1 h, and heat- 
inactivate The T4 ligase was then heat inactivated at 65°C 
for 30 min. After adapter ligation, each DNA samples was 
purified using Agencourt AMPure XP beads and ampli- 
fied using KAPA HiFi polymerase (KAPA biosystems, 
Woburn, MA, USA) with 400 nM primers of Access 
Array Barcode Library for Ion Torrent PGM Sequencer- 
96 (Fluidigm). The following PCR protocol was used: 
initial denaturation at 95°C for 5 min; eight cycles of 95°C 
for 15 sec, 60°C for 30 sec and 72°C for 1 min; followed 
by a final extension period at 72°C for 3 min. The amp- 
lified libraries were purified using Agencourt AMPure 
XP beads and quantified using the Qubit fluorometer 
with a Quant-it dsDNA HS kit (Invitrogen). The librar- 
ies of the parents and the 92 progeny were pooled (two 
individuals per library for the parents and 6-16 individ- 
uals per library for the progeny). The quality, size and con- 
centration of the pooled libraries were finally determined 
using the 2100 Bioanalyzer with a high sensitivity DNA 
chip (Agilent technologies, Waldbronn, Germany), and 
the library was diluted before template preparation. Emul- 
sion PCR and Ion Sphere Particles enrichment were car- 
ried out using the Ion OneTouch system with an Ion 
OneTouch 200 Template kit v2 DL (Life Technologies 
Corporation). The 200 template samples were sequenced 
on 318 chips using the PGM platform with an Ion PGM 
200 sequencing kit (Life Technologies Corporation). Data 
from the PGM runs were processed using the Ion Torrent 
Suite 3.4.2 software to generate sequenced reads. 
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Creation of the reference sequences and read mapping 

Raw reads were filtered to discard those of low quality and 
sorted to individuals according to the barcode sequences of 
the Access Array using the process_radtags program in 
Stacks [32]. The processes were conducted using the fol- 
lowing options: restriction enzyme of BamHI, rescue bar- 
codes and RAD-tags, discard reads with low quality scores, 
remove any read with an uncalled base and truncate final 
read length of 100 bp. To generate reference sequences for 
SNP discovery, the processed ddRAD-tags derived from 
the parents were clustered using USEARCH [33] with an 
identity threshold of 0.75. Centroid sequences derived from 
the clusters (the cluster size was >50, were used for read 
mapping as reference sequences. The processed reads de- 
rived from both of the parents and the 92 progeny were 
mapped to the reference sequence using CLC Genomics 
Workbench (CLC Bio, Aarhus, Denmark) with the follow- 
ing parameters: mismatch cost of 2, insertion cost of 3, de- 
letion cost of 3, length fraction of 0.95, similarity of 0.95 
and non-specific match handling of ignore. The output 
BAM files of each individual were sorted and indexed 
using SAMtools [34]. 

SNP calling and genotyping 

SNPs were called using Unified Genotyper in the Gen- 
ome Analysis Toolkit (GATK) [35], and the variants 
were filtered as follows: cluster window size of 10, 
quality of depth (QD) of <5 and genotype quality (GQ) 
of <30. For each given SNP site, the genotypes were la- 
beled as "A" for a homozygous genotype of reference 
alleles, "B" for homozygous genotype of alternate non- 
reference alleles, and "H" for the heterozygous geno- 
type. The low quality (QD < 5) and clustered SNP sites 
were removed from the genotype data, and the geno- 
types having lower reliability (GQ < 30) were replaced 
by a missing value. 

Genotyping for SSR markers 

To clarify the correspondence between the previous 
linkage map [16] and the present map, we mapped SSR 
markers located in the previous map to the present map. 
In addition, we developed novel SSR markers using a 
Roche 454 GS FLX Titanium platform. We constructed 
a shotgun library derived from a single individual of the 
Japanese eel and performed 454-pyrosequencing accord- 
ing to the manufacturers protocol (Roche 454 Life Sci- 
ences, Branford, CT, USA). De novo assembly of the 
processed reads was performed using the Newbler as- 
sembler 2.6 with default parameters. Auto-primer [36] 
was used to identify SSRs and design primer from the 
resultant contigs. PCR reactions and genotyping were 
performed as previously described [16]. 



Linkage map construction 

To construct a linkage map of the Japanese eel, hetero- 
zygous markers in either one of the parents were used as 
a pseudo-testcross [37]. Linkage analyses were con- 
ducted for markers genotyped in at least 88 of 92 (>95%) 
individuals using the R/qtl package [38]. Markers show- 
ing significant Mendelian segregation distortion (j(2 test, 
P < 0.001, d.f. = 1) were excluded. For all markers, pair- 
wise recombination estimates and logarithm of odds 
(LOD) scores were calculated, and marker pairs were then 
assigned to linkage groups at a LOD threshold of 7.0. 
Local marker order was determined by multipoint ana- 
lysis. Map distances (cM) were calculated using the 
Kosambi function. Following the initial mapping, potential 
genotyping errors that may be identified through apparent 
tight double-crossovers were calculated, and a doubtful 
genotype having an error LOD of >2 was replaced by a 
missing value. Subsequently, a linkage map was recon- 
structed using the error corrected dataset, including 
markers genotyped in at least 88 of 92 (>95%) individuals. 
The resultant map was drawn using the perl script of 
genetic_mapper revision 0.4 [39], with a modification of 
color images. The corrected length of the linkage map (L) 
was estimated by multiplying the length of each linkage 
group by (m + l)/{m - 1), where m is the number of 
markers in the linkage group [40]. The genome coverage 
of the linkage map (c) was next estimated by calculating 
c = 1 - e ~ 2dn/L , where d is the average interval of markers, 
n is the number of markers, and L is the length of the link- 
age map estimated above. 

Anchoring scaffolds in the Japanese eel genome 
assembly to the linkage map 

To identify the chromosomal order of scaffolds in the 
current genome assembly of the Japanese eel, we anchored 
the scaffolds to the linkage map using RAD-tag sequences 
for SNP markers and contig sequences for SSR markers. 
The eel draft sequence was obtained from the eel genome 
website [41]. To find specific scaffolds associated with the 
sequences of SNP and SSR markers, these sequences were 
BLAST searched against the draft genome data with a cut- 
off £-value of le-20. In cases where the query hit two or 
more scaffolds with a less than two-fold difference in the 
bit score, we did not assign any scaffold. The relative order 
between anchored scaffolds on each chromosome was de- 
termined by the following rule. The order of scaffolds was 
initially determined in accordance with the order of the 
anchor points to scaffolds on the female linkage map. If 
two or more scaffolds had the same anchor point, the lar- 
gest scaffold was used for ordering. Some scaffolds had 
the same anchor point on female map while their anchor 
points on male map were genetically separated. In such 
cases, if the sum length of these scaffolds was longer than 
that of a scaffold whose length was the longest in the same 
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anchor point, these scaffolds were ordered according to 
the male map. Otherwise the largest scaffold was ordered. 
In the present study, the scaffolds anchored only to the 
male map were not used for ordering, because the relative 
order between these scaffolds and that anchored only to 
the female map was indeterminate. 

Sequence comparison 

To examine the conservation of syntenies between the 
Japanese eel and zebrafish, and the Japanese eel and me- 
daka, we downloaded repeat-masked genomic sequences 
of zebrafish and medaka from the UCSC Genome 
browser [42]. We then excluded sequences that have not 
been mapped on chromosomes in each species from the 
dataset. For the Japanese eel, the sequences of scaffolds 
anchored to chromosomes (linkage groups) were used 
for sequence alignments. Before alignment, repetitive el- 
ements in the anchored scaffolds were masked using 
RepeatMasker [43]. The anchored scaffold sequences of 
individual chromosomes were firstly brought together 
into single-FASTA files with gap sequences of (N) 10>000 
between the scaffolds. This was to prevent a misdetec- 
tion of a conserved sequence segment that lies astride 
two different scaffolds in the alignment processes. Then, 
the single-FASTA files of individual chromosomes were 
integrated into a multi-FASTA file. To detect triplets of 
conserved sequence segments among the three species, 
multiple genome alignments were performed using the 
progressiveMauve aligner [44], with the following pa- 
rameters: seed weight of 20 and use a family of spaced 
seeds. Data for the conserved sequence segments in the 
output backbone files were used for constructing Oxford 
grids. To identify homologous regions in the spotted gar 
genome, the eel sequences associated with the conserved 
segments were searched against the spotted gar draft 
genome assembly using a BLAT program at the Pre 
Ensembl website [45], with default parameters. In cases 
where the query matched two or more chromosomal re- 
gions with a less than two-fold difference in the bit 
score, the match was considered insignificant. 

Results 

Double-digest RAD sequencing using Ion PGM platform 

Ion PGM sequencing generated 6.7 million raw reads 
from the mapping parents (Additional file 2: Table SI). 
After trimming the CSl-tgged and CS2- tagged adapters 
and filtering the low quality reads, 2,287,121 and 
2,310,898 ddRAD-tags were obtained from the dam and 
sire, respectively. These tags from the parents were 
clustered into 9,634 RAD loci, of which the centroid se- 
quences were used as reference sequences for SNP dis- 
covery. The reference sequences of the RAD loci are 
listed in Additional file 3. Then, Ion PGM sequencing 
yielded 96.3 million raw reads from the 92 progeny 



(Additional file 2: Table SI). After read processing, ap- 
proximately 780,000 (SD: 194,262) tags were obtained per 
individual. All ddRAD-tags obtained from the parents and 
the progeny were submitted to the CLC Genomics Work- 
bench and mapped to the reference sequences, resulting 
in 1,891,691 tags being mapped in the dam (on average, 
195.6x coverage per locus), 1,943,489 tags in the sire 
(201.1x), and approximately 600,000 (SD: 145,465) tags in 
each progeny (62.5x, SD: 15.0). The raw reads of the 
ddRAD-seq have been deposited in the DDBJ Sequence 
Read Archive (DRA) under accession number [DDBJ: 
DRA001739]. 

SNP detection and genotyping 

SNP detection and genotyping were performed using the 
Unified Genotyper in the GATK with default options, 
resulting in the identification of 14,397 biallelic SNP 
sites in the 9,634 loci. According to QD annotation, the 
SNP sites having low probability were excluded from the 
dataset. Additionally, the clustered SNP sites, which are 
often generated by misalignment in repetitive or homo- 
polymeric regions, were also excluded from the dataset. As 
a result, 10,943 SNP sites passed through the filtration and 
were used as putative markers for constructing the linkage 
map. Among them, 2,694 markers were heterozygous in ei- 
ther one of the parents and were genotyped for sufficient 
numbers of progeny. Of these markers, 23 markers were 
significantly distorted from a Mendelian segregation ratio 
(1:1 in the backcross model) and were excluded from the 
dataset. The remaining 2,671 markers were informative 
and used for constructing the linkage map. Details of SNP 
detection are shown in Additional file 2: Table S2. 

SSR genotyping 

To clarify the correspondence between the previous 
linkage map [16] and the present map, we chose 106 
SSR markers that were distributed genome-wide in the 
previous map. Additionally, to increase the number of 
SSR markers, a shotgun library was sequenced using a 
Roche /454 GS FLX Titanium platform, which generated 
1,005,279 reads (average length = 254 bp) representing 
255,755,881 bp. The raw sequencing data produced in 
this work have been deposited in the DRA under acces- 
sion number [DDBJ:DRA001189]. After trimming the 
adapters and filtering the low quality and small reads, 
the high quality reads were assembled into 10,502 con- 
tigs (Average length of 374 bp, N 50 length of 456 bp). 
After detecting SSRs from the contigs, we developed 144 
SSR markers using previously described methods [16]. 
Taken together, a total of 250 SSR markers were used for 
genotyping. To obtain informative markers for the map- 
ping family, we examined the segregation type of these 
SSR loci in the parents, which showed that 124 markers 
exhibited polymorphisms. These markers were scored in 
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the 92 progeny and tested for segregation distortion. In 
total, 115 markers were informative for the mapping fam- 
ily and were used to generate the linkage map together 
with the SNP markers. Details of primer sequences and 
DDBJ accessions [DDBJ:AB500964, AB860330-AB860372] 
are shown in Additional file 2: Table S3. 

A second-generation linkage map of the Japanese eel 

In a backcross model for a linkage analysis, a marker segre- 
gation type of "B/H" is used as a synonym of that of "A/H". 
We therefore converted the genotype "B" to "A" in the 
genotype data before linkage analyses. The genotype data 
for female and male linkage maps are shown in Additional 
file 2: Table S4, Table S5. 

The female map comprised 19 linkage groups with a 
total of 1,527 markers (1,428 SNP markers and 99 SSR 
markers) (Figure 1A, Table 1, Additional file 1: Figure S2, 
Additional file 2: Table S6). These 1,428 SNP markers 
were located in 1,004 RAD loci, accounting for 1.5 SNP 
sites per RAD sequence of 100 bp. The total length of the 
linkage groups was 1,748.8 cM, and the length of each 
linkage group ranged from 52.9 to 139.2 cM (mean 
92.0 cM). The number of markers per linkage group var- 
ied from 32 to 108, with an average of 80 markers per 
linkage group, and the average distance spacing between 
two markers was 3.6 cM. 

The male map initially consisted of 20 linkage groups in- 
cluding a total of 1,346 markers (1,243 SNP markers and 
103 SSR markers) at the LOD threshold of 7.0 (data not 
shown). These 1,243 SNP markers were positioned in 933 
RAD loci, accounting for 1.3 SNP site per RAD sequence. 
Of all the SNP markers on both maps, 574 markers in fe- 
male and 555 markers in male were co-located within 396 
common RAD loci. Using these SNP markers and SSR 
markers that were heterozygous in both parents, we iden- 
tified homologous pairs of linkage groups of the female 
and male. Eighteen pairs of linkage groups exhibited one- 
to-one correspondence between the female and male 
maps, whereas two linkage groups in the male were 
homologous to linkage group 14 in the female. More- 
over, markers located on the two linkage groups in the 
male were linked with a LOD threshold of 4.0. We 
therefore considered the two linkage groups as a single 
linkage group. Subsequently, the male map comprised 
19 linkage groups (Figure IB, Table 1, Additional file 1: 
Figure S3. Additional file 2: Table S6). This map spans 
1,294.5 cM, and the size of each linkage group ranged 
from 19.6 to 99.5 cM (mean 68.1 cM). The number of 
markers per linkage group varied from 22 to 127, with 
an average of 71 markers per group, and the average 
distance between two markers was 4.1 cM. 

Based on the homologous pairs of linkage groups of 
the female and male, the resultant map consisted of 19 
linkage groups including a total of 2,787 markers 



(Table 1). In this map, all informative markers showed 
detectable linkage to at least one other marker. The total 
genetic distances of the female and male maps were 
1,748.8 and 1,294.5 cM, respectively. Using the method 
of Chakravarti et al. [40], the corrected lengths of the 
maps were estimated at 1,890.0 cM and 1457.0 cM, 
which were converted into genome coverages of 85.4% 
and 84.8%, respectively. 

Integration of the genetic map and the assembled 
sequences 

Targeting markers to specific regions of the scaffolds en- 
abled us to identify the chromosomal order of the scaf- 
folds in the Japanese eel genome assembly. Using BLAST 
searches, 2,260 of 2,672 SNP markers and 82 of 115 SSR 
markers sequences were assigned to 1,252 scaffolds cover- 
ing 151 Mb (13% of the current assembly) (Figure 1C, 
Table 1, Additional file 1: Figure S4, Additional file 2: 
Table S6). Among them, 483 scaffolds covering 70 Mb 
(6%) were ordered to be consistent with the order of 
markers determined on the linkage maps. However, for 
most of the scaffolds, we were unable to identify their 
orientation on the linkage groups, because their sequence 
lengths of them were too short (average length was 
145 kb), and genetically separated markers within the scaf- 
fold were rarely observed. 

Syntenic relationships between the Japanese eel and 
model fishes 

Anchoring the genomic sequences to the chromosomes 
(linkage groups) of the Japanese eel made it possible 
to compare the conservation of synteny between the 
Japanese eel and other fishes. To compare the extent of 
conserved synteny among teleosts, we firstly identified 
745 triplets of conserved sequence segments among the 
Japanese eel, zebrafish and medaka (Additional file 2: 
Table S7). These segments were arrayed in Oxford grids 
according to the chromosomes of each species (Figure 2). 
For comparison of the eel-zebrafish, the oxford grid 
showed that 582 of 745 (78%) segment pairs fell in 44 syn- 
tenic boxes, each comprising six or more segment pairs 
(Figure 2A). Moreover, 297 (40%) segment pairs were 
mapped into 15 syntenic boxes that had at least 16 seg- 
ment pairs. Boxes with high number of segment pairs in- 
dicate a high degree of conserved synteny between the eel 
and zebrafish. In contrast, 163 (22%) segment pairs did 
not fall into syntenic boxes. This represents a trace of in- 
terchromosomal rearrangements after they diverged from 
their last common ancestor. For comparison of the eel- 
medaka, 637 of 745 (86%) conserved sequence segment 
pairs fell in a total of 41 syntenic blocks (Figure 2B). 
Among them, 392 (53%) segment pairs were contained 
in 17 highly syntenic boxes, while 108 (15%) segment 
pairs did not fall into syntenic boxes. These results 
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Figure 1 Linkage maps of the Japanese eel and integration of the map and the assembled sequences. (A) Female linkage map. Bars on 
each linkage group represent the loci of mapped SNP and SSR markers. Genetic distances (cM) were calculated using the Kosambi function. 
(B) Male linkage map. Detailed maps of female and male are shown in Additional file 1: Figure S2 and Figure S3. An asterisk shows that two 
marker loci are linked with a LOD threshold of 4.0. (C) Integration of the linkage map and the assembled sequences for LG1. Vertical red and blue 
bars represent female and male linkage group, respectively. SNP markers are black, and SSR markers are red. Boxes sandwiched between the 
female and male linkage groups represent scaffolds. Scaffolds for which the relative order on each chromosome has been determined are 
labeled in green. A scaffold that is mapped on other linkage groups is labeled in gray. This inconsistency could be caused by a misassembly in 
the current assembly. Transverse lines link the location of each marker on the genetic maps with its corresponding anchored scaffolds. Gaps 
between scaffolds are arbitrary because the actual distance between scaffolds is unknown. Scale bars represent 10 cM and 500 kb. Details of all 
chromosomes and anchored scaffolds are shown in Additional file 1: Figure S4. 



suggest that the extent of conserved synteny between 
the eel and medaka is greater than that between the eel 
and zebrafish. As expected the degree of synteny be- 
tween zebrafish and medaka was highly conserved 
compared with that between the eel and zebrafish, and 
the eel and medaka (Additional file 1: Figure S5). 

We next analyzed the syntenic conservation between 
the genomes of the Japanese eel and the spotted gar, 
which is a model species that serves as an outgroup for 
exploring the mechanisms of evolution after the TGD 
[4]. The eel sequences derived from the 745 conserved 
sequence segments were searched against the gar draft 
genome assembly using the BLAT program, resulting in 
the 745 sequences of the eel being assigned to 453 



unique sequences of the gar (Additional file 2: Table S7). 
Of these, 437 sequences have been mapped on the chro- 
mosomes of the gar. The oxford grid between the eel 
and gar showed that 295 of 437 (68%) putative ortholo- 
gous sequence pairs fell in 31 syntenic boxes, each com- 
prising of six or more ortholog pairs, while 142 (32%) 
ortholog pairs did not fall into syntenic boxes (Figure 2C). 
Although there were many traces of interchromosomal 
rearrangements between the two species, the analysis 
showed some one-to-two correspondences between 
these genomes. For example, most of the eel orthologs 
mapped on gar chromosome 21 (Loc21) were found on 
eel linkage group 3 (Aja3, 10 of 17 orthologs) and Ajal6 
(6 of 17 orthologs). In the Oxford grid between the eel 
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and medaka, Aja3 and Ajal6 had an orthologous rela- 
tionship to medaka chromosome 12 (01al2) and 01a9, 
respectively, which were previously shown to be paralo- 
gous derived by the TGD [19]. Thus, Aja3 and Ajal6 are 
likely to be a paralogous chromosome pair generated by 
the TGD. One-to-two relationships between the eel and 
gar genomes were also observed in Loc4 and Aja2/Aja9, 
Loc9 and Aja4/Aja5, Locll and Aja4/Aja7, Locl4 and 
Aja5/Ajal9, Locl7 and Aja8/Ajal4, and Loci and Aja7/ 
Ajal9. These pairs of eel linkage groups (or fragmented 
regions of them) also may possibly be paralogous pairs 
generated by the TGD. 

Discussion 

Double-digest RAD sequencing using the next-generation 
benchtop sequencer 

We combined the ddRAD-seq approach with Ion PGM 
sequencing to perform high-throughput SNP genotyp- 
ing. To the best of our knowledge, this is the first report 
of constructing a highly dense genetic map using the Ion 
PGM platform. Compared with the RAD-seq approach, 
the ddRAD-seq permits greater robustness in region re- 
covery [8]. Indeed, our simulation showed that 87-88% 
of 150,000,000 ddRAD-tags were sufficient to recover 



12,190 regions at 117-124x in the ddRAD-seq approach 
(Additional file 2: Table S8). In contrast, 15-16% of 
150,000,000 RAD-tags recovered 9,275 regions at 25- 
26 x in the RAD-seq approach. Details of the comparison 
of the ddRAD-seq and RAD-seq approaches are shown 
in Additional file 4. Although the read throughput of the 
Ion PGM is lower than that of GAIIx and Hiseq se- 
quencers, the cost-effective sequencing and the rapid 
runtime (4.5 hours per run) of this platform facilitates 
routine analysis. We therefore conducted ddRAD se- 
quencing with a relatively small number of multiplexed 
samples (-16 samples were pooled in our study). This 
has the advantage of making the library at low cost, be- 
cause it needs relatively fewer sequence primers (or 
adapters) attaching barcode sequences for individual 
samples. In fact, a maximum of 16 sequence primers 
were required for ddRAD sequencing for the parents 
and 92 progeny in our study (Additional file 2: Table 
SI). Moreover, our libraries were constructed using a 
unique method of combining the normal ddRAD-seq 
protocol with an amplicon tagging protocol that is 
widely used in amplicon sequencing assay (Additional 
file 1: Figure SI). According to the normal RAD-seq or 
ddRAD-seq protocols, the barcode sequences for multi- 
plexed samples are directly attached to the sequence 
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(See figure on previous page.) 

Figure 2 Oxford grids showing conservation of syntenies between the Japanese eel and model fishes. Conserved sequence segments are 
arrayed according to chromosome (or linkage group) for each species. Numbers in boxes indicate the number of the conserved sequence 
segments. (A) Comparison of the Japanese eel and zebrafish. (B) Comparison of the Japanese eel and medaka. (C) Comparison of the Japanese 
eel and the spotted gar. Details of the conserved sequence segments are listed in Additional file 2: Table S7. 
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adapters. Therefore, if a reanalysis is conducted with a 
change in restriction enzyme pair to obtain other SNP 
makers, the sequence adapters need to be redesigned for 
individual samples. In contrast, our method required 
only a pair of adapters to be changed without redesign- 
ing the sequence primers for individual samples. Thus, 
this method permit flexible working. 

The second-generation genetic map of the Japanese eel 
and its integration with the assembled sequences 

We constructed female and male genetic maps of the 
Japanese eel using double-digest RAD sequencing. The 
map has been improved considerably in the present 
study. While the previous genetic map contained 319 
markers [16], the present map contains 2,672 SNP and 
115 SSR markers. In addition, the present map repre- 
sents a ~2.0-fold increase in map resolution (female: 
3.6 cM, male: 4.1 cM) over the previous map (female: 
7.2 cM, male: 6.3 cM). The correspondence between the 
linkage groups of the previous map and those of the 
present map are shown in Additional file 1: Figure S6. 
The previous map consisted of 22 linkage groups. In the 
present map, linkage group 20 (LG20) of the previous 
map was integrated with LG16. In addition, the linkage 
between the markers located on previous LG21 was dis- 
rupted, and the markers have been mapped on different 
linkage groups (LG3 and LG11 of the present map). The 
correspondences among previous LGs (LG13, LG17 and 
LG22) were unknown because of the absence of SSR 
markers. Consequently, the present map comprises 19 
linkage groups, which corresponds with the karyotypes 
of the Japanese eel (2n = 38) [46,47]. 

The total map lengths were strikingly different between 
the sexes (the corrected map lengths were 1890.0 cM for 
female and 1457.0 cM for the male map). Such a sex dif- 
ference in recombination has been described in various 
animals [48], although the molecular mechanisms of this 
phenomenon remain the subject of debate [49]. Among 
fishes, an overall lower recombination ratio has been iden- 
tified in males of rainbow trout (Oncorhynchus my kiss) 
[50], zebrafish [51], and fugu {Takifugu rubripes) [20]. In 
particular, a strong suppression of recombination has been 
observed around the centromeres in these fishes. In the 
Japanese eel, the marker distribution on the male linkage 
map showed that the apparent recombination deserts were 
located near the center of the chromosome on 10 linkage 
groups: LG1, LG2, LG4, LG5, LG7, LG8, LG9, LG12, 
LG16 and LG19 (Additional file 1: Figure S7A). According 



to the tendency observed in other fishes, this result sug- 
gested that the Japanese eel has at least 10 meta- or sub- 
metacentric chromosomes. To confirm this, we performed 
a cytogenetic observation, as described previously [52]. 
The result showed that the karyotype of this species con- 
sisted of 10 meta- or submetacentric and 9 acrocentric 
chromosome pairs (Additional file 1: Figure S8). Thus, the 
recombination reduction around centromeres occurs dur- 
ing male meiosis of the Japanese eel, and it is likely that 
this phenomenon is widespread among teleosts. In the 
fugu, the recombination deserts around the centromeres 
were observed on not only male but also female; however, 
the suppressed regions on the female map were extremely 
narrow compared with that on the male map [20]. Al- 
though, in the Japanese eel, the recombination on the 
female map is more uniform than that of the male map, 
slight reductions of the recombination were observed 
on LG10 and LG18 (Additional file 1: Figure S7B). In 
addition, the recombination in these regions was also 
relatively suppressed on the male map (Additional file 
1: Figure S7C). Thus, the markers within the recombin- 
ation deserts in LG10 and LG18 of the female map may 
be located very close to the centromeres. 

Using markers targeting specific scaffolds, 1,252 scaf- 
folds in the current genome assembly of the Japanese eel 
were assigned to 19 linkage groups. The high-density gen- 
etic map, to which the genomic sequences are assigned, fa- 
cilitates fine mapping and positional cloning of QTLs 
controlling important traits for eel aquaculture, such as 
growth rate, timing of metamorphosis and disease resist- 
ance (reviewed in [53]). The map will also contribute to 
the identification of candidate genes underlying these 
QTLs by analyzing synteny between this species and 
model fishes. 

Furthermore, integration of the genetic map and the as- 
sembled sequences could improve the genome assembly. 
For instance, in the fugu genome project, 338 Mb (86%) of 
the fugu draft sequence were placed onto 22 chromo- 
somes using 1,220 SSR markers, and the current assembly 
was generated by filling some gaps and by organizing scaf- 
folds into chromosomes based on the genetic map [20]. In 
present study, 151 Mb (13%) of the eel assembled se- 
quence was assigned to the linkage groups using 2,260 
SNP markers and 82 SSR markers. The abundant markers 
were used for anchoring the scaffolds; however, the total 
number of anchored sequences was relatively small com- 
pared with that of fugu. This is because the current draft 
genome sequence of the Japanese eel is more fragmented; 
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the N50 length is 53 kb in the Japanese eel assembly [15] 
compared with 858 kb in the fugu assembly version 4 
(by our calculation). Additionally, we found that some 
markers assigned to the same scaffolds were mapped on 
different linkage groups or distant loci within a linkage 
group (Additional file 1: Figure S4, Additional file 2: 
Table S6). This inconsistency seemed to be caused by 
assembly errors in these scaffolds of the current assem- 
bly. In our view, more effort to sequence the genome of 
the Japanese eel is necessary to identify the order of 
more genomic sequences on the chromosomes based 
on the highly dense genetic map for this species. 

Genome evolution after the TGD 

Teleost genomes have experienced a TGD at the base of 
the teleost lineage [23,24,54] and eight major chromo- 
somal rearrangements that occurred before the divergence 
of the zebrafish and medaka lineages ([19], Figure 3A). Re- 
cent phylogenetic analysis estimated the divergence times 
of 328 million years ago (Mya) for the gar-zebrafish and 
245 Mya for zebrafish-medaka [55]; therefore, the TGD 
and subsequent chromosomal rearrangements are pre- 
sumed to have occurred in a relatively short period of -83 
Mya. The eel lineage diverged from the other teleost line- 
ages within this period (a divergence time of 284 Mya, 
[55]). Hence, comparing the genome of the Japanese eel 
with those of other teleosts and the spotted gar would pro- 
vide an opportunity to understand the history of genome 
evolution soon after the TGD. The medaka genome has 
undergone no major chromosomal rearrangements after 
the divergence of the zebrafish and medaka lineages and 
has preserved the ancestral genomic structures of two spe- 
cies; therefore, the medaka genome would be useful in re- 
vealing the chromosomal rearrangement events that 
occurred in the basal lineage of the teleosts. In contrast, 
the zebrafish genomes have experienced many interchro- 
mosomal rearrangements in the zebrafish-specific lineage 
after separating of the medaka lineage [19,20]. Hence, we 
focused on the comparison of the Japanese eel and me- 
daka genomes to clarify the details of the chromosomal re- 
arrangement events soon after the TGD. According to the 
ancestral genomic structures of zebrafish and medaka, we 
re-arrayed the medaka chromosomes in the Oxford grid 
between the eel and medaka (Figure 3B). As seen in the 
synteny analysis, some chromosome pairs have two-to- 
two correspondences between two species. For instance, 
most of the conserved segments on eel linkage group 3 
(Aja3) were assigned to medaka chromosome 12 (01al2) 
and its paralogous chromosome 01a9 (Figure 3C). Simi- 
larly, Ajal6 shared the conserved segments with 01al2 
and 01a9. By comparing the genomes of the eel-gar, most 
of the gar orthologs mapped on Aja3 and Ajal6 were lo- 
cated on the single chromosome Loc21; therefore, these 
linkage groups are also likely to be a paralogous 



chromosome pair generated by the TGD. These results 
suggested that the quadruplet of Aja3/Ajal6-01al2/01a9 
was a group of ohnologous chromosomes, which are de- 
rived from a single ancestral chromosome before the whole 
genome duplication [54,56,57]. The apparent ohnologous 
relationships were also observed in Aja4/Aja7-01al6/01all, 
Ajal3/Aja2-01a3/01a6 and Ajal2/Aja6-01a7/01a5. Such 
syntenic assignments that form the two-to-two correspon- 
dences among the ohnologous chromosomes are thought 
to arise for the following two reasons. First, the Japanese eel 
and medaka were diverged from their common ancestor 
soon after the TGD without undergoing many nucleotide 
changes between paralogous chromosomes of their ances- 
tor; therefore, the genomes of two species share relatively 
few nucleotide changes specific to each exact pair of 
orthologous chromosomes, which leads to vague orthol- 
ogy assignments among the ohnologous chromosomes. 
Consequently, the case that partial sequence similarity be- 
tween orthologous chromosomes and their paralogous 
chromosomes is higher than that between the exact pair 
of orthologous chromosomes occurs more often in the 
comparison of the eel-medaka than that of zebrafish- 
medaka (Additional file 1: Figure S5). Second, previous 
studies of the Hox cluster in eel species implied that this 
species could have highly similar sequences between par- 
alogous chromosome pairs. Although further study of the 
eel genome sequence is necessary to identify the genome- 
wide sequence similarity between paralogous chromosome 
pairs of this species, a high similarity between paralogous 
chromosome pairs could be responsible for the increased 
frequency of the assignments between orthologous chro- 
mosomes and their paralogous chromosomes. 

The distribution pattern of the conserved sequence 
segments could help us to deduce the history of genome 
evolution soon after the TGD. Figure 4 illustrates our 
proposed scenario for part of the eight major chromo- 
somal rearrangement events at the base of the teleost 
lineage. Kasahara et al. [19] inferred that OlalO, 01al3 
and 01al4 were derived from two ancestral chromo- 
somes, and 01al4 was a fusion of one duplicate of the 
chromosomes (Figure 4A). The Oxford grid indicated 
that the conserved sequence segments located on these 
medaka chromosomes were located on eel chromosomes 
Aja8, Ajall and Ajal5 (Figure 4B). Given the same num- 
ber of chromosomes between the two species, the eel ge- 
nomes are thought to have also experienced a fusion 
event. In this case, two scenarios are generally conceiv- 
able (Figure 4C). One scenario is that the fusion event 
occurred in the common ancestral lineage before the di- 
vergence of the two species. The alternative scenario is 
that two fusion events occurred in each lineage after the 
divergence of the two species. In both scenarios, the pre- 
dictive distribution of conserved sequence segments in 
the Oxford grid would eventually become the same 
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Figure 3 Two-to-two correspondences between the Japanese eel and medaka chromosomes. (A) Teleost-specific whole genome duplication 
(TGD) and subsequent major chromosomal rearrangement events. Kasahara et al. [19] described the estimates of the teleost ancestral chromosomes, 
labeled from A to M, and the eight chromosomal rearrangements. Phylogenetic relationships among fishes and their divergence times are from 
Broughton et al. [55]. The eel lineage diverged from other teleost lineage soon after the TGD. (B) Re-arrayed the Oxford grid between the Japanese eel 
and medaka. According to the ancestral genomic structures of zebrafish-medaka, the medaka chromosomes are re-arrayed. (C) Two-to-two relationships 
among the ohnologous chromosomes. Eel linkage group 3 (Aja3) and Aja16 shared the conserved segments with medaka chromosome 12 (Ola 12) and its 
paralogous chromosome of Ola9. Aja3 and Aja 1 6 also have two-to-one relationship with the gar chromosome 21 (Loc21). Similar relationships are observed 
in Lod 1-Aja4/Aja7-Ola1 1 /Ola 16, Loc3-Aja13/Aja2-Ola3/Ola6 and Loc5-Aja12/Aja6-Ola5/Ola7. 



pattern (Figure 4C). Indeed, the distribution of con- 
served segments in the actual Oxford grid was remark- 
ably similar to that in the predictive model, even 
thought no segments were detected between Aja8 and 
01al4. We assume that the first scenario is more likely 
because it is more parsimonious compared with the al- 
ternative scenario. Likewise, Kasahara et al. [19] de- 
duced that 01a3, 01a6 and 01a23 originated from two 
ancestral chromosomes, and 01a6 was the fusion of one 
duplicate of the chromosomes (Figure 4D). These chro- 
mosomes have the correspondences to Aja2 Aja9, Ajal3 
and Ajal7 (Figure 4E). We estimated that the four eel 



chromosomes have undergone no major chromosomal 
rearrangements after the TGD, and the fusion occurred 
in the medaka lineage after the divergence of the two 
species, because the distribution of the conserved se- 
quence segments in the actual Oxford grid was coinci- 
dent with the predictive model for this scenario 
(Figure 4F). An alternative scenario is that the fusion 
occurred in the common ancestral lineage of the two 
species, and the additional fission occurred in the eel- 
specific lineage. In the predictive model for the alter- 
native fission scenario, at least one chromosome of the 
eel is presumed to have a one-to-three relationship to 
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Our proposed scenario; the fusion event occurred before the divergence of the two species. 
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Alternative scenario; the fusion events occurred in each lineage after the divergence of the two species. 
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Our proposed scenario; the fusion event occurred in the medaka lineage after the divergence of the two species. 
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Figure 4 (See legend on next page.) 
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(See figure on previous page.) 

Figure 4 Estimates of chromosomal rearrangements that occurred in the base of the teleost lineage. (A, D and G) The teleost ancestral 
chromosomes and the chromosomal rearrangement events based on the estimates of Kasahara et al. [19]. (B, E and H) The Oxford grids 
between the Japanese eel and medaka. (C, F and I) Possible scenarios of the chromosomal rearrangement events and predictive models of 
Oxford grids between the Japanese eel and medaka for each scenario. Stars represent the divergence points of the two species. In the Oxford 
grids, eel chromosomes are labeled as A and B, and their duplicates are labeled as A' and B'. Medaka chromosomes are labeled as a and b, and 
their duplicates are labeled as a' and b'. Large and small circles show conserved sequence segments between orthologous chromosomes and 
between orthologous chromosomes and their paralogous chromosomes, respectively. 

V ' ) 



medaka chromosomes: for example, eel chromosomes 
with the structure A'B', A'B'-l and AB'-2 (Figure 4F). 
Such a distribution was not observed in the actual Ox- 
ford grid. We therefore assume that the alternative fis- 
sion scenario is rather roundabout and unlikely. In 
addition, the Oxford grid also indicated the trace of the 
interchromosomal rearrangement occurred in the eel- 
specific lineage. 01a24, 01a22, 01al6 and Olall originated 
from two ancestral chromosomes (Figure 4F) and have the 
relationships with Ajal9 Aja7 and Aja4 (Figure 4G). Judg- 
ing form the distribution of the conserved sequence seg- 
ments in the Oxford grid, Aja7 was presumably generated 
by the fusion of one duplicate of the chromosomes in the 
eel-specific lineage (Figure 4H). 

Taken together, given that the medaka genome has 
preserved the ancestral genomic structures of zebrafish- 
medaka [19], our results suggested that one of the eight 
major chromosomal rearrangements occurred in the an- 
cestral lineage of eel-zebrafish-medaka after the TGD 
(-348 Mya to 284 Mya), and another fusion occurred 
after the divergence of the eel lineage and the ancestral 
lineage of zebrafish-medaka and before the divergence of 
the zebrafish and medaka lineages (284 Mya to 245 
Mya). We were unable to infer the credible scenarios for 
the other chromosomal rearrangement events because of 
a lack of eel genomic sequences available for genome- 
wide comparison with other genomes. Further genome 
sequencing for eel species will provide opportunities for 
understanding further details of the genome evolution at 
the basal level of the teleost lineage and the eel-specific 
lineage. 

Conclusions 

We constructed a second-generation linkage map of the 
Japanese eel using the ddRAD-seq approach with the 
Ion PGM sequencing platform. The maps for female 
and male spanned 1748.8 cM and 1294.5 cM, respect- 
ively, and were arranged into 19 linkage groups. Of the 
current genome assembly of the Japanese eel, 1,252 
scaffolds covering 151 Mb were assigned to the 19 link- 
age groups using 2,260 SNP markers and 82 SSR 
markers. The integration of the linkage map and the as- 
sembled sequences represents an important resource 
for future QTL studies of this species. Comparisons of 
the genomes between the Japanese eel and model fishes 



demonstrated highly conserved synteny among teleosts 
and revealed part of the genome evolution soon after 
the TGD. 

Availability of supporting data 
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The primer sequences for the SSR markers have been 
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